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Abstract 


It has been well established in the literature that the appropriate criterion 
for optimum trellis coded modulation design on the additive white Gaussian 
noise channel is maximization of the free Euclidean distance. We show here 
that when trellis coded modulation is used on a Rician fading channel with 
interleaving/deinterleaving, the design of the code for optimum performance is 
guided by other factors, in particular the length of the shortest error event 
path , and the product of branch distances (possibly normalized by the 
Euclidean distance of the path) along that path . Although maximum free 
distance (d free ) is still an important consideration, it plays a less 
significant role the more severe the fading is on the channel. These 
considerations lead to the definition of a new distance measure for 
optimization of trellis codes transmitted over Rician fading channels. If no 
interleaving/deinterleaving is used, then once again the design of the trellis 
code is guided by maximizing df ree . 

It is also shown that allowing for multiple symbols per trellis branch, i.e., 
multiple trellis coded modulation (MTCM) , provides an additional degree of 
freedom for designing a code to meet the above optimization criteria on the 
fading channel. It is here where the MTCM technique exploits its full 
potential. 
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Introduction 

In previous publications [1-4], the authors have considered the performance of 
conventional and multiple trellis codes in a Rician fading environment 
characteristic of the mobile satellite channel. Results were reported for 
both the case of coherent detection and differentially coherent detection with 
and without the use of channel state information (CSI). The primary emphasis 
in these previous works was the degradation in performance produced by the 
fading for trellis codes designed to be optimum on the additive Gaussian noise 
channel (AWGN) . 

In this report, we look more carefully into the properties of the trellis 

coded modulation (TCN) that enter into the various expressions for average bit 

error probability corresponding to the above-mentioned cases and then proceed 

to use these as design criteria for conventional and multiple trellis codes 

operating over a fading channel. It is shown that, whereas maximizing free 

Euclidean distance (d. ) is the appropriate optimum design criterion on 

free 

the AWGN, over Rician fading channels with interleaving/deinterleaving, the 

asymptotic performance of TCM at high signal-to-noise ratio (SNR) is dominated 

by several other factors depending on the value of the Rician parameter K, 

i.e., the ratio of direct plus specular power (coherent components) to diffuse 

power (noncoherent component). In particular, for small values of K (the 

channel tends toward Rayleigh) , the primary design criteria for high SNR 

become: . 1) the length (to be defined in the report) of the shortest error 

event path, and 2) the product of branch distances along that path, with 

d, a secondary consideration. Thus, at low values of K, the longer is 
free 

the shortest error event path and the larger is the product of the branch 
distances along that path, the better the code will perform even though 
dj. does not achieve its optimum value over the AWGN ! As K increases, the 
significance of these primary and secondary considerations shift relative to 
I one another until K reaches infinity (AWGN) , in which case optimum performance 

| is once again achieved by a trellis code designed to maximize <* free « 

[ 

To demonstrate the above analytically, our approach will be to first take the 
previously derived [1-4] upper bounds on average error probability performance 
in the presence of fading and investigate their asymptotic behavior as SNR 

j 

i 

i 
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gets large. A comparison of these results for the different cases, i.e., 
coherent versus differentially coherent detection and CSI versus no CSI, will 
reveal some striking similarities with regard to the way in which certain 
properties of the trellis code design affect the rate of descent of average 
error probability with average SNR. When these properties are used as a 
motivation for good code design, we then show that multiple trellis coded 
modulation (MTCM) [4], wherein more than one channel symbol is assigned to 
each trellis branch, is a natural choice in this situation. In fact, we will 
show that MTCM allows us to achieve a performance on the fading channel 
superior to that achievable by a conventional (single channel symbol per 
trellis branch) TCM of the same throughput and number of trellis states. 

Since conventional TCM can be viewed as a special case of multiple TCM l A ] , we 
shall begin our detailed discussion with a description of the system model for 
the more general MTCM. 

System Model 

The system under consideration is illustrated by the block diagram in 
Figure 1. The elements indicated in dashed lines represent system functions 
that are peculiar to the form of detection, i.e., coherent versus 
differentially coherent. In particular, the differential encoder is required 
for differentially coherent detection but not coherent detection. Similarly, 
the injection of a pilot tone at the transmitter and its extraction at the 
receiver for purposes of demodulation are required for coherent detection but 
not differentially coherent detection. All other blocks have similar system 
functions for both forms of detection. 

The key elements of Figure 1, as far as our interest in this report is 
concerned, are the trellis encoder and the MPSK signal set mapping.* A 
multiple trellis encoder has b binary input bits and s binary output symbols 
which are mapped into k M-ary symbols in each transmission interval (see 


*In keeping with our previous work [1-4] on the performance of TCM over 
fading channels, we consider only MPSK modulation. 
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Figure 2). For such a transmitter, the throughput is b/k bps/Hz which has a 
unity bandwidth expansion relative to an uncoded system with a 2* >/ * t -point 
signal constellation. 

One way of producing such a result is to partition the s binary encoder output 
symbols into k groups of m = log^M symbols each. Each of these groups 
results in an MPSK output symbol. Clearly, to achieve this result, the 
transmitter parameters s, k, and M must be chosen such that s = k log 2 M. 

Another possibility is to partition the s binary encoder output into k groups 
of symbols where each group now corresponds to, in general, a different size 
MPSK signal set. Thus, if, for the ith group, nu = log 2 M i’ then we 

require s = m + m + . . . , + m . Another interpretation of this 

1 ^ 1)4*1 

requirement is in terms of the total number of multiple signals 2 used in 

the trellis diagram. Since, for any encoder, s > b+1, then m^, m 2 , ...» 

Ym* 

m^ must be chosen such that 2 1 is greater than the total number of multiple 

signals required in the trellis diagram. If M is not a power of 2 then the 

equivalent requirement is < II Hi. 

~ i=l 

In keeping with our previous work on MTCM, we shall emphasize the former 
partition, i.e., k equal size groups, but also allow for the latter partition 
in terms of an example. A complete exposition of the applications and 
advantages of partitioning into unequal group sizes is the subject of a future 
paper by the authors. 

Asymptotic Performance Analysis 

A. Ideal Interleaving/Deinterleaving 


An upper bound on the average bit error probability is obtained as 


P 


b 


< £ £a(x,x)p(x)P(x + x) 


(i) 


where a(x,x) is the number of bit errors that occur when the sequence x is 
transmitted and the sequence x / x is chosen by the decoder, p(x) is the a 
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priori probability of transmitting x, and is the set of all coded sequences. 
Also, in (1), P(x -» x) represents the pairwise error probability, i.e., the 
probability that the decoder chooses x when indeed x was transmitted. The 
upper bound of (1) is efficiently evaluated using the transfer function bound 
approach applied to TCN in [1-5]. 

Evaluation of the pairwise error probability depends on the proposed decoding 
metric, the presence or absence of CSI, and the type of detection used, i.e., 
coherent versus differentially coherent. For example, consider the case of 
coherent detection with ideal (perfect) CSI and a Gaussian decoding metric 
(correlation metric). For this case, it has been shown ([1], Equations (20) - 
(21)) that, conditioned on the fading amplitude vector £ = (p » p 2> ... p r ), 
the pairwise error probability is given by 


P(x -» x|p) < exp ■{- 


E 

s .2 * 

■ - — - d (x,x) ■ 

1 *»o J 


(2a) 


where 


d 2 (x,x) = E p \ l x n “ * n | 2 (2b) 

ncn 

represents the square of the weighted Euclidean distance between the two 

symbol sequences x and x and n is the set of all n for which x^ 4 x n » 

In (2b), p is the normalized fading amplitude for the nth transmission 
n 

interval which for Rician fading has the probability density function 
(independent of n) 


p(p) 


f 2p(l + K) exp [-K - p 2 (l + K)I 0 (2p /K(l + K>); p > 0 
\o ; otherwise 


( 3 ) 


where I Q (x) is the zero-order modified Bessel function of the first kind. 

Also, in (2a), E /IT is the M-ary channel symbol energy to noise spectral 
s 0 

density ratio. For multiple trellis coding, the symbol energy E is related 

s 

to the bit energy E. by E = (b/k)E. . 

D S D 
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For ideal interleaving/deinterleaving, the p 's in (2b) are independent 

n 

and, as noted above, identically distributed. Thus, averaging (2a) over the 
probability density function of (3) gives [1] 


p(x x) < n 


1 + K 


exp 


ncn 1 + K + — 

4N Q '"n 


X - X 


K T~ |x - x | 2 
4 N q 1 n n' 


1 + K + 


E 

|x - x | 
4 N q n n' 


(4) 


which can be written in the form 


P(x -» x) < exp 



(5a) 


with* 


d" = 


E 

ncn 


a 1 2 y r \ — 1 

x - x K / E \ 

n n' I s 1 

tt; ln 


1 + K + TT - |X - X . 

4N n n' 

>-v X > 


1 + K + ~ |x - x | 2 
4 Nq n n 


\ 


1 + K 


(5b) 


In 


2n 


Note that for K = «» (no fading). 






(6a) 


2 

and thus d is merely the sum of the squared Euclidean distances along the 
error event path. 


*Note that d satisfies the conditions for a distance metric (see Appendix A). 
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For K = 0 (Rayleigh fading), 



d 


2 

2n 



In 



(6b) 


and thus for reasonably large E s /Nq values, d^ is the sum of the 
logarithms of the squared Euclidean distances (each weighted by E g /4Ng). 

Equivalently, the upper bound on pairwise error probability for this special 
case becomes 


P(x -* x) < 




(7) 


i.e., it is proportional to the product of the squared Euclidean distances 
along the error event path. 


For values of K between 0 and », the equivalent squared Euclidean distance, 
2 

d , will be a mixture of the above two special cases. We now turn our 
attention to asymptotic behavior. 


At sufficiently high SNR, (4) simplifies to 


P(x ■* x) < n 


-K 


/ 5 s' 

i x - x r 

1 n n' 

\v 

' 4(1+K) 


( 8 ) 


Substituting (8) into (1), we get 


p t>& 


EE a(x,x) p(x) 
x.x e 


n 

ncq 


1 -K 

/ S s\ 

|x n - *n |2 

\v 

4 ( 1+K) 


( 9 ) 


whose evaluation depends on the particular trellis code design. 
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To identify the important considerations for such a design in a fading 

environment, we first observe that the upper bound of (9) will be dominated by 

the term in the summation which has the slowest rate of descent with 

G /N . This in turn corresponds to the error event path with the smallest 
s 0 

number of elements in n. We refer to this path as the "shortest error event 
path" and define it more formally as the error event path with the smallest 
number of nonzero distances between itself and the correct path. We also 
define the "length", L, of the shortest error event path by the number of 
nonzero pairwise distances between the symbols along its branches and those 
along the correct path. It is to be emphasized that pairwise distance refers 
to Euclidean distance between corresponding symbols on the pair of paths being 
compared . 


In terms of the above definitions, we see that asymptotically with high SMR, 
the average bit error probability is approximately given by* 


P 


b 



(1+K)e K 
E /N 

i s 0 , 


L 


E /N n » K 
s 0 


( 10 ) 


where C is a constant that depends on the distance structure of the code. For 
now, the important point to be observed in (10) is that P*, varies inversely 
with (I /N rt ) L . 

For conventional trellis coding, wherein each branch in the trellis 
corresponds to a single MPSK output channel symbol, the shortest error event 
path is that error event path with the fewest number of branches having 
nonzero pairwise distance from the correct path. For most cases, this also 
corresponds to the shortest length (in branches) error event and thus L is 
just the number of branches on this path. 


*The approximation in (10) stems from the fact that we consider only a single 
term in (9), namely, that due to the shortest (in length) error event path. 
Also, for simplicity, we shall ignore the number of such paths in the 
computation of C. As such, (10) also represents a strict lower bound on P^. 
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For multiple trellis coding, wherein each branch in the trellis corresponds to 

more than one MPSK output channel symbol, the ’’length" of the shortest error 

event path is always equal to or greater than the number of branches along the 

shortest error event path. In view of (10), the possibility of a value of L 

greater than the length (in branches) of the shortest error event path is 

significant and what affords multiple trellis coding the opportunity of 

improving trellis coding performance on the fading channel. A simple example 

of this comment, which will be explored in a more general context later on in 

the report, pertains to trellis diagrams with parallel paths between states. 

b 

This occurs whenever 2 , i.e., the number of possible transitions from a 
given state, exceeds the number of states of the trellis diagram. In such 
cases, with conventional trellis coding, the minimum distance error event path 
is often the parallel path, i.e., the shortest error event path is of length 
one branch, and thus L = 1. With MTCM, we have the option of still having a 
trellis diagram with parallel paths, yet because of the multiplicity, we can 
have more than one nonzero pairwise Euclidean distance along that path; hence 
the opportunity of achieving a value of L greater than one. 

As a second example, consider the case of differentially coherent detection of 
MPSK (i.e., MDPSK) with no CSI and a Gaussian decoding metric. Although this 
metric is suboptimum for MDPSK, it was shown in [4] that it is considerably 
easier to implement than the true optimum metric and thus of significant 
practical interest. From Equations (25) and (26) of [4], the upper bound on 
pairwise error probability for the Rician channel is given by 


P(x •+ x) 


< n 

ncn 


1 + K 


1 + K + |x 
n 



G 

2\ zf- (1-4X) 
. 0 


(2X) 2 ( 1+K) 


x exp < 


e ; 

2A K r~ (1-4X) |x - x | 
Nq 1 n n 



r e 1 

1 + K + |x - x | 2 

2X ~ (1-4X) - (2 X) 2 (1+K) 

n n 

o J 


(ID 
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where X is a Chemoff bound parameter to be optimized. For the Rayleigh 
channel (K=0) , (11) simplifies to 


P(x -» x) < n 
ncn 


1 


1 +■ |x - X 

n n 1 

E 

2X j- 8 (1-4X) - (2X) 2 


0 


( 12 ) 


The bound in (11) cannot be optimized over X independent of the index n. On 
the other hand, the result in (12) can be optimized over X independent of 
n. In particular, differentiating the expression in brackets in (12) with 
respect to X and equating the result to zero gives the optimum Chemoff 
parameter for the Rayleigh channel, namely, 


E 

s 

4tr 


opt 


E 

1 + 2 — 

N o 


(13) 


For high SNR, (13) simplifies to 


X 

opt 



(14) 


Although (14) is not the optimum value of X for (11), we use it nevertheless 
(resulting in a looser upper bound) to arrive at a result in a desirable 
form. Thus, substituting (13) in (11) gives 


p (x -» x) < n 

nert 


1 + K 


1 + K + 


|x n - * 
16 


. \* f E 

n 1 2 _s - (1+K) 

L N o J 


x exp 


1 + K + 



16 



2 




- (1+K) 


) 
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= exp 



(15a) 


where, analogous to (5b), 


<T = 


E 

ncn 


1 

2 


X n - X n 

n n 


1 + 


i « 1 2 r s “i 

x - x E 

K + n - n 2 ~ - (1+K) 

16 L N o J 


+ 




1+K 


\ 


/ 


(15b) 


is a distance metric for E s /Nq > (l+K)/2. 

For sufficiently high SNR, (15) can be further approximated by 


P(x -* x) 


< n 

ncn 



1 



8(1+K) 


2 


-K 

e 


(16) 


Finally, using (16) in (1), we get 

i EX a(x,x)p(x) IJ — 1 r e K < 17 > 

x,x e ncn |x n - x n l 

\V 8(1+K) 

which is identical to (9) except for a scale factor and thus can be written in 
the form of (10). 

As a third example, consider once again the case of coherent detection of 
MPSK, now, however, with no CSI. From Equations (28) and (29b) of [1], the 
upper bound on pairwise error probability for the Rician channel is given by 
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P(x -» X) 



( 18 ) 


The expectation over the Rician probability density function of (3) is 
performed in [1], which reduces (18) to 


P(x -» x) < II exp lx 2 |x r - x n | 2 ) 
ner) ^ 0 ' 


x e 


' K [l - ^ /* n(e)exp(n 2 (e))erfc n(9) dej 


(19) 


where 


n(0) 


\|x - x r 

n n 1 



v'T+K 


- /K cos 0 


( 20 ) 


For sufficiently large E^N^, the first term of (20) dominates, in which 
case n(6) becomes independent of 0 and the evaluation of the integral 
becomes trivial. Thus, approximating (20) by its first term allows (19) to be 
simplified to 


P(x ■* x) < II expfx 2 ^ X n ~ X n^ ) 
ncrj \ 0 ' 


x e 


-K 


1 - /ir X 


|x n ' *n |2 (i 
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x exp 4 


_ . « |<c S 

X|x n - x n ST 


*erfclx|x - x 


n 


2N„ 


( 21 ) 


Furthermore, since for large E /N. we can use the asymptotic expansion for 

s u 

the complementary error function, namely. 


erfc x 


~ exp(-x ) 
✓? x 



( 22 ) 


then, using (22) in (21) gives the further simplification 


P(x ■* x) < 


x) < n 

ncn 


j 

/i \ 

„2, . ,4 / _S_ 

2X x - x I — 
n n \2N 


exp (x 2 ^ S Ix^ - x_ 


0 ncn 


n n 


G 


2L r 


> L " 2L '\t) JZ, 


x n - 


A I ' 

X n 


(23) 


where L is the "length" of the error event path corresponding to x, i.e., 
n 

the number of elements in n. 


The result in (23) can be optimized over the Chemoff parameter. Performing 
this optimization gives 



L 

_EL 


N 


0 ncn 



(24) 


which, when substituted in (23), gives the tightest upper bound on pairwise 
error probability, namely. 
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L 



Finally, substitution of (25) in (1) allows computation of the upper bound on 

bit error probability, which once again can be put in the form of (10) . where 

L would be the smallest value of L . The primary difference between this 

n 

third example and the previous two is the manner in which the constant C in 
(10) depends on the distance structure of the trellis code. More about that 
later on. 

An Example 

Consider a rate 1/2 trellis coded 4PSK modulation with the 2-state diagram 
illustrated in Figure 3. Each branch of this diagram is labeled with the 4PSK 
symbol transmitted when making that particular transition. The shortest error 
event path is of length 2 branches and both of these have nonzero distance 
with respect to the branches of the correct path (assumed to be the all zeros 
path). Thus, the "length" L of the shortest error event path is equal to two. 

For coherent detection without CSI, we need to compute the ratio of the sum of 
the squared branch distances to the product of the squared branch distances in 
accordance with (25). From Figure 3, the square of this ratio is easily 
computed as 


r ix - x i 

z -' n n 
nen 


n i*„ - v 

l nen 


(4+2) _ 36 _ _9 

= 64 = 16 

(4x2) 


(26) 



-14- 


Thus, letting L = 2 in (25), E = E. , and substituting (26) into this 

T) SO 

same equation gives the upper bound on pairwise error probability 


P(x -» x) < 


2 “2K-, lV .2 I ^ l x n 
e e (1+K) I ncn 


x | 

tl' 


*V 


V 

N. / 


n ix - 


incn 


n 


x | 

n 1 


„ 2 -2K,, „ x 2 
9e e (1+K) 

' l \\ 2 


(27) 


which, for large E. /N* , is also approximately equal to the upper bound on 

D 0 

bit error probability. Letting K=0 gives the identical result obtained in 
[1, Equation (67)) for the Rayleigh case. 

For coherent detection with CSI, we need to compute the product of the branch 
distances in accordance with (9). For the shortest error event path this 
product is easily computed as 


II |x n - x n | 2 =4x2=8 (28) 

ncn 

Thus, keeping only the term in (9) corresponding to the shortest error event 
path we get 


P 


b 


1 



n 

ncn 



[4 (1+K) ] 2 


2 


-2K 

e 



( 29 ) 
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which for K=0 agrees with [1, Equation (49*)]*. 

Finally, for differentially coherent detection with no CSI, (17) also requires 
calculation of the product of branch distances. Using (28) and again keeping 
only the term in (17) corresponding to this path, we get 


PbS /^ 

(M 

\ N 0/ 


-2K 


/= \2 n x - x 

ncn 

[ 8 ( 1 + K )] 2 


8e 2K (1+K) 2 
s \ 2 


/*b\ 

\ N o/ 


(30) 


which, for K=0, agrees with (56a) of [4]. 


B. No Interleaving/Deinterleaving 


If no interleaving/deinterleaving is employed, then the assumption that the 
fading is independent from symbol to symbol is no longer valid. In fact, if 
the fading is sufficiently slow as to be constant over the duration of a 
number of symbols equal to the minimum distance error event path, then for 
coherent detection with a Gaussian metric, the average bit error probability 
is asymptotically upper bounded by 


= C 1 exp | 


2.2 s 

—o d 

H free 4N, 


(31) 


where (^ is a constant, d£ ree is the squared free distance of the code, i.e., 


|x - x r 

* 4 1 n 1 


d . = min .... . 

free " n n 

ncn 


( 32 ) 


*Equation (49') of [1] should be corrected to read P = 


b ‘V^V 2 
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and the overbar denotes averaging over the Rician probability density function 
of (3). Performing this average gives 



1+K 

1 1+K +C 2 


exp 



1+K+C 




2 

free 



(33) 


which can be approximated for large E s /Nq b y 


P 


b 


1+K -K 
C =- e 


A 1 — 
free 


(34) 


where 


For differentially coherent detection with a Gaussian metric, the analogous 
result to (31) is obtained from [4; Equation (25)] and is given by 
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(35) 


where n corresponds to the dominant error event path. Again performing the 
average over the fading probability density function gives 
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which for large E s /Nq simplifies to 


P < 


min 


(1+K) e 


-K 


b ~ X 0 C 2 n t 1 -( 2 X 0 ) 2 , V i n |2] 

nen 


min 

c i 

(1+K) e K 

X. /E \ 



2X o (1 - 4 v(ii^ 

|Z 

lx -S | 2 n [1-<2X ) 2 |x -8 | 2 ] 


ncri 

ken 


k^n 


which, when optimized over the Chernoff parameter, can be put in the form of 
(34). 

i 

I Thus, for either coherent or differentially coherent detection, comparing (34) 

with (10), we observe that, with no interleaving/deinterleaving, independent of 
the trellis code , the asymptotic steepest rate of descent of with E g /N Q is 
inverse linear. 

I 

I 

i Multiple Trellis Coded Design for Fading Channels 

In this section, we expand upon the brief comments previously made about the 
suitability of using multiple trellis codes on the fading channel. In 
particular, we shall demonstrate that multiple trellis coding has the ability 
to produce a performance behavior that otherwise would not be achievable with 
conventional trellis coded MPSK of the same effective code rate, complexity 
(number of trellis states), and number of signal points M. 

Recall that with conventional trellis coding (i.e., one symbol per trellis 
branch) the length L of the shortest error event path is equal to the number 
of trellis branches along that path. Equivalently, if we assume that the all 

1 zeros path in the trellis diagram represents the transmitted sequence, then L 

is the number of branches in the shortest length path to which a non-zero MPSK 
symbol is associated. Since a trellis diagram with parallel paths is 


i 
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constrained to have a shortest error event of length one branch, we immediately 

have L = 1, i.e., the average bit error probability asymptotically varies 

inverse linearly with E /N_ . Thus, we conclude that for conventional 
s 0 

trellis coding on the fading channel, from an error probability performance 
standpoint, it is undesirable to design the code to have parallel paths in its 
trellis diagram. Unfortunately, however, for a conventional rate n/(n+l) 
trellis code, when 2° exceeds the number of states, one is forced into a 
trellis with parallel paths. Thus, in these instances, there is no choice but 
to accept an inverse linear asymptotic performance on the fading channel. 

When multiple trellis coding is employed, we regain the option of designing a 

trellis diagram with parallel paths yet still being able to achieve an 

asymptotic performance on the fading channel which varies inversely with E g /M 0 

at a rate faster than linear. The reason behind this lies in the fact that even 

if there exist parallel paths in the trellis, it is now possible to have more 

than one MPSK symbol with non-zero Euclidean distance associated with an error 

event of length one branch. In fact, even if the multiplicity, k, is equal to 

just two, as long as all of the pairs of MPSK symbols assigned to the parallel 

paths are not alike in either of the two symbol positions, i.e., they both 

represent non-zero Euclidean distances, then the pairwise error probability 

associated with that error event path will vary inversely with the square of 

E /JT . Thus, the primary objective for good multiple trellis code design 
s o 

on the fading channel is to maximize the number of symbols with non-zero 
Euclidean distance along the error event path of shortest length . A secondary 
objective is to minimize the constant C in (10), which, depending on the 
detection scheme (i.e., coherent or differentally coherent), requires either 
maximizing the product of the squared branch distances or maximizing the 
product of the squared branch distances each normalized by the square root of 
their sum along this shortest length path. 

The simplest way of illustrating the above considerations is with an example. 

In [5], we consider the design of rate 2/3 conventional trellis coded 8PSK 

systems for the AWGN. In particular, for the 2-state case, the trellis 

diagram is illustrated in Figure 4. Since for a rate 2/3 code there are 
2 

2 =4 possible transitions from each state to the next state, a 

conventional 2-state trellis must have 2 parallel paths between states. In 
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Figure 4, these parallel paths are labeled with the MPSK output symbol 
transmitted over the channel when that particular transition occurs. 

The performance of this conventional TCH scheme used on the Rician fading 
channel with coherent detection at the receiver is given in [3J. Because of 
the existence of parallel paths in Figure 4, this performance will 
asymptotically (for sufficiently large E /N rt ) vary inversely with E /N„. In 

SO SO 

* 2 

particular, since, for the parallel paths, |x r - x^l = 4, then, for example, 
for coherent detection with ideal CSI, the dominant term of (9) yields 


P 

b 



(38) 


where we have also noted that for a rate 2/3 code, E = 2E,. 

s b 

Now consider the rate 4/6 (=2/3), 2 state multiple (k=2) trellis code, with 
the trellis diagram illustrated in Figure 5. For this code b=4, s=6, and thus 
there are 2^ = 16 possible paths leaving each state. Since there are only 2 
states, each transition between states has 8 parallel paths. The sets of 8PSK 
symbol pairs for these transitions are illustrated directly on the branches of 
the trellis diagram and correspond to the signal points in the 8PSK signal 
constellation as shown. The construction of these sets is given below. 
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(39) 


First of all, we note that all of the parallel paths have a distinct pair of 

8PSK symbols which differ from each other in both symbol positions. Thus, in 

so far as single branch error events are concerned, the number of 8PSK symbols 

wth zero Euclidean distance from the correct path is two. Second, for an 

error event of length two branches, there are at least two out of the possible 

four 8PSK symbols that have non-zero Euclidean distance from the correct 

path. This is true for each of the 64 such possible paths. Finally, then, in 

accordance with the previous definition of L, the length of the shortest error 

event path is two, i.e., the asymptotic average bit error probability 

performance of this coded modulation scheme will vary inversely with the 

square of E g /NQ as desired. As a specific demonstration of this result, 

consider again the case of coherent detection with ideal CSI. If we 

arbitrarily take the all zeros path as being the correct one, then, for the 

two branch error event, the one parallel path in F that differs by one symbol 

from the correct path is [0 4] (or (4 0]), which has squared Euclidean 

distance 4. Similarly, the one parallel path in G that differs by one symbol 

from the correct path is [0 2] (or [2 0]), which has squared Euclidean 

distance 2. For the one branch paths in parallel with [0 0], the smallest 

2 

squared Euclidean distances occur for path [1 5], i.e., 4 sin (ir/8) and 
2 

4 sin (5*/8) whose product is less than (4) (2) = 8. Thus, the dominant 
term in (9) will correspond to the one branch error event, i.e., a parallel 
path, for which the average bit error probability is asymptotically 
approximated by 


P 


b 


1 

4 
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It goes without saying that the signal sets assigned to the trellis of 
Figure 5 will not produce optimum performance on the AWGN channel. We now 
investigate the extent to which that performance is degraded relative to the 
optimum assignment for the AWGN illustrated in Figure 6 (see [2], Figure 4a). 
Since on the AWGN channel asymptotic bit error probability performance is 
measured by the free distance of the trellis code, we shall now compare this 
quantity for the trellis diagrams of Figure 5 and 6. 

From Table 1 of [3], we find that d 2 =4 sin 2 (ir/4) + 8 sin 2 (ir/8) = 3.172 for 

tree 

the trellis code of Figure 6. The minimum Euclidean distance path for the 

trellis of Figure 5 also has length two branches. Then, since the minimum 

squared Euclidean distance between sets E and F and between sets E and G is 

2(4 sin 2 (ir/8) ) = 1.1715, we have that d 2 = 2(1.1715) = 2.343 or a penalty 

tree 

of 10 log i() (3. 172/2. 343) = 1.315 dB. 

As a second example, consider a 4 state, rate 4/5 multiple (k - 2) trellis 
code whose 5 output symbols are mapped into one QPSK symbol and one 8PSK 
symbol in each transmission interval in accordance with the generalized MTCM 
transmitter of Figure 1. The advantage of such a hybrid MTCM scheme over one 
whose multiple output symbols all come from the same alphabet is that the 
former is much less sensitive to carrier synchronization errors at the 
receiver. This stems from the fact that signal points in a QPSK constellation 
have greater distance between them than those in an 8PSK constellation and are 
thus less sensitive to phase jitter. Thus, one can derive the carrier 
reference necessary for coherent demodulation from only the received QPSK 
symbols. Potentially then, one can obtain an overall improvement in average 
system bit error probability performance relative to a 4 state, rate 
4/6 (= 2/3) coded 8PSK system also of multiplicity k = 2 and throughput 
2 bps/Hz, despite the fact that the latter would perform better in an ideal 
(perfect carrier synchronization) environment. 
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Figure 7 is an illustration of the trellis for the above hybrid scheme. Since 
b 

b = 4, there are 2 =16 paths emanating from each node. Thus with 
4 states, we assign 4 parallel paths between nodes and the trellis is fully 
connected. The construction of the signal sets for assignment to the branches 
of this trellis which produce optimum performance on the Rician fading channel 
is given below: 
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(41) 


For the above set assignment, the error event path with the shortest length is 
the parallel path. Since for each set of parallel paths, both symbol 
positions represent distinct assignments, i.e., nonzero Euclidean distance, 
then the length of this one branch path is L = 2 symbols and, from (10), the 
asymptotic behavior of the average bit error probability on the Rician fading 
channel varies as the inverse square of E^/N^. The minimum squared 
distance between the parallel paths is 4. However, the three branch error 
event path with signal set assignments, E, C, and G (see Figure 7) has a 

smaller squared distance equal to 2 + 0 + 4 sin2(ir/8) = 2.586 and thus we have 

dl =2.586. 
free 
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In 16], Ungerboeck considered a 4 state, rate 2/3 conventional (k = 1) trellis 

coded 8PSK system with 2 parallel paths between nodes (i.e., a half-connected 

trellis). For his scheme, the parallel paths represented the minimum distance 

2 

error event and it was found that d, =4. However, because of the absence 

free 

of multiplicity, if this code were used on the fading channel, it would have 
an asymptotic error probability performance that varied only inverse linearly 
with E s /N q . Thus, with the above hybrid MTCM scheme, we obtain a 
performance on the ideal AWGH channel inferior to that of the equivalent 
Ungerboeck code, with much improved performance on the fading channel and 
perhaps equivalent performance in the presence of imperfect carrier 
synchronization . 

Set Partitioning for Multiple Trellis Coded MPSK 

With the previous examples as a basis, we now describe a set partitioning 
method for the design of multiple trellis coded MPSK to achieve optimum 
performance on the Rician fading channel. In [7], Ungerboeck presented a set 
partitioning method for multiple trellis coding on AWGN channels. The method, 
which makes use of k-fold (recall that k denotes the multiplicity) Cartesian 
products of the sets found in Ungerboeck’ s original set partitioning method 
for conventional (k=l) trellis codes [6], is in essence the k-dimensional 
generalization of the latter. Since, as we have already observed, the 

criteria for designing optimum trellis codes on the fading channel are quite 

\ 

different from that for the AWGN channel (i.e., maximize d^ ree ) * one might 
anticipate that the set partitioning method would also be significantly 
different than that discussed in [7]. Indeed such is the case with the only 
common thread between the two being that we start the procedure with a k-fold 
Cartesian product of the complete MPSK signal set. The remainder of the 
procedure, ^long with the motivation for it, is described in what follows. 

For simplicity of explanation, we shall first focus our attention on the 
multiplicity 2 case. 

Let Aq denote the complete MPSK signal set (i.e, signal points 
0,1,2, ... ,M-1) and A^(x)Aq denote a 2-fold Cartesian product of A^ with 
itself. Thus, an element of the set A Q (x)A 0 is a 2-tuple whose first and 
second symbols are each chosen from the set A Q . The first step is to 
partition A. Q (x)A. Q into M signal sets defined by the ordered Cartesian 
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product* A q (x)B^; i = 0,1,2,... M-l where the jth element (j = 

0,1,2, ... ,M-1) of is defined by nj+i and the addition is performed modulo 
M. Thus, the jth 2-tuple from the product A q (x)B^ is the ordered pair 
(j,nj+i). The selection of the odd integer multiplier n is the key to the set 
partitioning method. Before presenting the relation whose solution provides 
the desired value(s) of n, we shall first discuss what the first partitioning 
step is trying to accomplish. 

The first partitioning step accomplishes two purposes. First it guarantees 
that within any of the M partitions, each of the two symbol positions has 
distinct elements. That is to say, for any 2-tuple within a partitioned set, 
the Euclidean distance of each of the two symbols from the corresponding 
symbols in any other 2-tuple within the same set is nonzero. We recall that 
this is the desired property from the standpoint of maximizing the ’’length" of 
the shortest error event path. Stated another way, if the shortest error path 
is of length 1 branch (i.e., parallel paths exist in the trellis and have the 
smallest Euclidean distance from the correct path), then the length L of this 
path is guaranteed to have value 2 and the error probability performance on 
the fading channel will vary as the inverse square of Eg/N^. 

The second purpose accomplished is that the minimum Euclidean distance product 
between 2-tuples within a partitioned set, i.e., the minimum of the product of 
the distances between corresponding symbol positions of all pairs of 2-tuples, 
is maximized. To determine the value of this distance, we observe that the 
set B^+l is merely a cyclic shift of the set B^ , i.e., a clockwise 
rotation of the corresponding signal points by an angle 2ir/M. Thus, since 
the squared Euclidean distance between a pair of 2-tuples is the sum of the 
squared Euclidean distances between corresponding symbols in the 2-tuples, the 
above set partitioning guarantees that the intra distance structure of all of 
the partitions A q (x)B^ is identical . Thus, it is sufficient to study the 
distance structure of Aq(x)b^, henceforth called the generating set . For 
this set, the product of the squared distances between the ith 2-tuple and the 
jth 2-tuple is 


*By ordered Cartesian product we mean the concatenation of corresponding 
elements in the two sets forming the product. 
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(< .in^)) (, Si „ 2 (^)) 


(42) 


Thus, based on the above requirement, we wish to choose n such that the 
2 

minimum of Ild.^ over all pairs of 2-tuples in A 0 (x)B q is maximized. 

Making use of the symmetry properties of the MPSK signaling set around the 
circle, we can write the above as follows. Letting n* denote the desired 
value(s) of n, then n* has the maximin solution(s). 


max 


f (n*) = * , - s 
n=l,3,5, 


min 

, M/2-1 m=l,2, . . . , M/2-1 


16 sin 


! /mir\ . 2 /n*mir\ 

(r) sin (— ) 


(43a) 


Equation (43a) has the equivalent vector form 


max 


min 


g(n*) = * 

n— 1 ,3,5, 


., M/2-1 m=l ,2 , . . . , M/2-1 


■ m _ | 
z - 1 


i n * m i i 
Z -1| 


(43b) 


where z = exp(j2ir/M) represents a unit vector with phase equal to that 
between adjacent points in the signal constellation. For M=2, we have the 
degenerate solution n*=l. 

Note that the additive inverse(s) of n*, i.e., M-n* is (are) also valid 

M-n 

solutions. This conclusion is easily derived by substituting z = 
exp( j2ir(M-n)/M) = z n for z° in (43b) and observing that the equation is 
unchanged . 

Table 1 gives the solution of (43) for M=4, 8, 16, 32, and 64. 


The sets obtained by this first partition are illustrated below for the case 
M=8 and n*=3 which is the single solution of (43). (Note that the additive 
inverse n*=5 could also have been used to generate (44).) 
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Note that sets A q0Bq* A^(x)B^, Aq(x)B^, Aq(x)B 6 of (44), which have the 
largest distance between them, i.e., the largest inter distance, are identical, 
respectively, to sets E, G, F, and H in Figure 5, where only 4 sets of 8 
elements each were needed and n*=5 rather than n*=3 was used. Equivalently, 
one could have employed sets A^^B^, A^(x)B^, A q (x)B 5 , and A Q (x)B^ in 
Figure 5. 

If one was to follow tradition, then the second step in the set partitioning 
procedure would be to partition each of the M sets A^(x)B^; 
i=0,l,2, . . . ,M-1, as in (44), for example, into two sets C q® d ^ 0 an< * 

(x) , with the first containing the even elements (j=0,2,4, . . . ,M-2) 
and the second containing the odd elements ( j=l ,3 ,5 , . . . ,M-1) . While it is 
true that for each of these partitioned sets, the elements in each of the two 
symbol positions would still be distinct, unfortunately, it is not always true 
that these sets have the minimum Euclidean product distance between 2-tuples 
maximized. Thus, we immediately conclude that the appropriate method to 
generate the sets on the second level of partition does not necessarily follow 
a tree structure. 

After a little thought, it becomes obvious that one should partition in such a 
way that the resulting sets (of dimensionality M/2) should have an 
intra distance product structure equal to that which would be achieved by a 
first level partitioning in accordance with (43) with, however, M replaced by 
M/2. Interestingly enough, this second level of set partitioning 
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can still be achieved by an odd-even split of a first level partitioning like 
that previously described; however, the value of n* used to generate the sets 
on this first level should be that corresponding to the solution of (43) with 
M replaced by M/2 (or its additive inverse) . Note that if one of the 
solutions of (43) (including the additive inverse) is equal to one of the 
solutions of (43) (again including the additive inverses) when M is replaced 
by M/2, then indeed the first two levels of set partitioning follow a tree 
structure . By inspection of Table 1, we observe that when M=4, 8, and 32, 
there exists a value of n* common to these values of M and the corresponding 
values of M/2. Thus, for M=4, 8, and 32 the first two levels of set 
partitioning follow a tree structure whereas, for M=16, they do not. 

As an example of the second level of set partitioning, the sets that result 
from the partitioning of the sets in (44) are given below 
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and the corresponding tree structure is illustrated in Figure 8. 

The third and succeeding steps are identical in construction to the second 
step, namely, we partition each set on the present level into two sets 
containing the alternate rows with the sets for the present level determined 
by a value of n* computed from (43) with M successively replaced by M/4, M/8, 
etc. 
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To extend the previous procedure to higher multiplicity of order k>2, we can 
simply form the k/2-fold ordered Cartesian product of all the sets on a given 
partition level created by the procedure for k=2. The result of this 
procedure is illustrated in Figure 9 for k=4. If the number of sets required 
to satisfy the trellis is less than the number of sets generated on a 
particular partition level, then one would choose those that have largest 
interd istance, as was done in the example of Figure 5. Also, as for the k=2 
case, the sets formed by this generalized set partitioning procedure will all 
have distinct elements in any of the k symbol positions. Thus, the length of 
a 1 branch error event path will have value k, and hence the asymptotic bit 
error rate performance of such a trellis code on the fading channel will vary 

inversely with (E s /NQ) n with n>k. Thus we can conclude that in so far as 
the rate of decay of average bit error probability with i^/N^ is 
concerned, incorporating multiplicity in the design of the trellis code has a 
similar effect to using diversity , a technique commonly employed to improve 
performance on fading channels. 

A more optimum procedure for k>2 would be to generalize (43b) to 
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The set of maximum solutions n^,n 2 » * • • » n j t _j can used to produce all 
of the necessary sets on any level of partition. 


More Examples 

1. 4 State Rate 4/6 Trellis Coded 8PSK 

Consider a 4 state rate 4/6 trellis coded 8PSK system designed for optimum 
performance on the Rician fading channel. The trellis diagram appears as in 
Figure 7, where the signal point sets assigned to the branches are derived 
from the previous procedure and are given by 
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For this assignment, each set has a minimum squared intrad istance 4+4 =8, 
which represents the minimum square Euclidean distance between parallel 
paths. Each of these one branch paths when viewed as an error event path has 
a length L=2 with respect to any of the other paths in parallel with it. 

Every other error event path (consisting of two or more branches) has a length 
L greater than two regardless of which path is chosen as the correct path. 
Thus, the dominant term in the asymptotic bit error probability expression of 
(9) corresponds once again to the parallel paths. Since the minimum squared 
intrad istance for each of the two symbols in any of these 2-tuples is 4, then 
analogous to (40), which describes the performance of the same scheme using 
only a 2 state trellis, we get 


p ~ I / Ml+K) -K 

b ' 4 i 

\ “o 

_ 1_ ( l+K) 2 -2K 

" 16 /5 x 2 

(a 

i.e., a gain of 4.5 dB in SNR. 

2. 2 State Rate 4/12 Trellis Coded 8PSK 
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This is an example of a multiplicity 4 trellis code optimally designed for the 
Rician fading channel. Since 4 8PSK symbols are transmitted over the channel 
for each 4 bits into the encoder, the throughput of the code is 1 bps/Hz. The 
state diagram is as in Figure 5, where E, F, G, and H are chosen as those sets 
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which have largest inter distance in the construction of Figure 9. As such, we 
have 


E = A o 0 B o® A o® B o 


G - A 0 ® B 2 ® A 0® B 2 


0 0 0 0 


0 4 0 4 

15 15 


1111 

2 2 2 2 


2 6 2 6 

3 7 3 7 

4 4 4 4 

F " A o® B 4® A 0® B 4 = 

3 3 3 3 

4 0 4 0 

5 15 1 


5 5 5 5 

6 6 6 6 


6 2 6 2 

.7373. 


_7 7 7 7_ 

"0202“ 


"0606“ 

17 17 


13 13 

2 4 2 4 


2 0 2 0 

3 13 1 

4 6 4 6 

H = A 0® B 6® A 0® B 6 = 

3 5 3 5 

4 2 4 2 

5 3 5 3 


5 7 5 7 

6 0 6 0 


6 4 6 4 

.7575. 


.7171. 


For this code all sets have squared Euclidean intradistance equal to 8. The 
asymptotic average bit error probability for coherent detection with ideal CSI 
is computed analogous to (40) and is given by 


P. = 


1 /411+Ki e ~K) 
4 | E^ 


. • 2 Mf (a • 7 ‘~ ^ 2 

|4 sin (-1) |4 sin 


. 2 1 5ir\V 
!in iry/ 


,, -4K 

(1+K) e 

far 


(50) 


Note that because the multiplicity is equal to 4, the average bit error 
probability of (4) varies inversely with (E|j/No)4, where now E s = E]j. 


3. 4 State Rate 5/6 Trellis Coded 8PSK 

This is an example of a multiplicity 2 trellis code with noninteger throughput 
(i.e., 2.5) optimally designed for the Rician fading channel. The trellis 
diagram is as in Figure 5 with the following set assignments: 
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C = A 0 @B 0 
E = A 0 ®B 4 
G = A 0 ® Bl 

I = a 0 ®b 5 


D = A 0 ®B 2 
F = A 0 ®B 6 
H = A 0 ®B 3 

J = a 0 ®b 7 


(51) 


where the sets A^®B^; i = 0,1,2. ..,7 are as in (44). Again we remind 

the reader that n* = 5 rather than n* = 3 could have been used to generate 

these sets. By construction, the parallel paths in each of the above sets 

have length L=2. Also, the minimum square Euclidean distance product for 

these parallel paths is nd 2 = (4 sin 2 (ir/8)) x (4 sin 2 (5ir/8)) = 2. If we 

examine all of the two branch error event paths, we find that the shortest 

length of these paths is also L=2. The minimum squared Euclidean distance 

2 2 

product for these two branch paths is (4 sin 2(ir/8)) x (4 sin (ir/8)) = 1.172, 
which is smaller than 2 and thus dominates the asymptotic error probability 
performance. In particular, for coherent detection with ideal CSI, we have 
that 



where E s = 2.5 E*,. 

The squared free distance of this code is determined by the error event path 

2 

of length three branches indicated in Figure 5 and is given by d. 

2 2 tree - 

2(4 sin (ir/8)) + 4 sin (ir/8) = 1.757. The equivalent code optimized for the 

AWGN [8] has df ree = 2 but only L=l. 

4. 8 State Rate 3/6 Trellis Coded 8PSK 

The last example is another one with noninteger throughput (i.e., 1.5). Also, 

3 

since there are 2 =8 branches emanating from each node and the trellis has 
8 states, there are no parallel paths and the trellis is fully connected. The 
trellis diagram is illustrated in Figure 10 with the following set assignments: 
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A (0) = A 0 @B 0 

- (5: 
B (0) = Aq®B 4 

.1 » (i) J 0 <i) . . _ ,. .... , .(0) (0) 

Also A and B ; l = 1,2,3 are cyclic shifts of A and B , 

respectively, by i rows. This code achieves a minimum diversity L=3 

corresponding to an error event path of length 2 branches. Also, the minimum 

2 2 

product of squared Euclidean distances is given by lid = (4 sin (ir/8)) 

2 

x (4 sin (5ir/8)) x 4 = 8, which corresponds to the error event path 
(relative to the all zeros path) with 8PSK symbols (1,3) and (0,4) along its 
branches. Finally then, the bit error probability is asymptotically 
approximated by 


P 


b 




1 

8 


0.79 


x, „,3 -3K 
(1+K) e 



(54) 


where E s /Nq = 1.5 E^/No- The squared free distance is determined by 

the two branch path with 8PSK symbol assignments (3,1) and (1,7) and is given 
2 

by d free = 4 sin2(3*/8) + 4 sin2(w/8) = 5.172. 


Another interesting generalization of this example is as follows. When there 
are no parallel paths in the trellis, as is true here, it may be desirable to 
go to a larger modulation constellation (e.g., 16PSK rather than 8PSK) to 
achieve an increase in diversity. To demonstrate this idea, consider the 
following set assignment to the trellis diagram of Figure 10. First construct 
sets A^(x)B q and A 0 (x)B^ with n* = 3 and for 16PSK. Next, partition 
these sets in accordance with Figure 9. Now choose the sets in Figure 10 as: 


A 


( 0 ) 



0 0 


” 1 7 

2 6 


3 13 

4 12 


5 3 

6 2 
8 8 

• B<0> - * 

7 9 
9 15 

10 14 


11 5 

12 4 


13 11 

14 10_ 


15 1 


( 55 ) 
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together with the appropriate cyclic shifts as before. We note that with this 
assignment, all of the two branch error event paths differ from the correct 
path in four 16PSK symbols and thus the diversity is now L=4. Unfortunately, 
the minimum product of squared Euclidean distances will be reduced to the 

value nd 2 = (4 sin 2 (2*/16) ) x (4 sin 2 (10ir/16)) x (4 sin 2 (ir/16)) 
x (4 sin 2 (9*/16)) = 1.172 and thus the choice between 8PSK and 16PSK 
modulations depends on the value of E])/Nq one is operating at. 
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Appendix A: 


Proof that d 2 Defined in (5b) Satisfies the Conditions 
for a Distance Metric 


Theorem : Let K > 0, y > 0, and 6 be a metric. Then 


d(x 


,y> = (— Juhbll 1 1 ln ( l * * X, i r> ) ) 

\1 + K + y 6 2 (x,y) Y \ // 


1/2 


(A-l) 


is a metric, where 


< 2 (x,y) = | x-y | 2 


(A-2) 


Proof: 


Letting 


4 o (x,y) = l+ic * 2(x » y) 


(A-3) 


then, we can rewrite (A-l) as 


d(x,y) 


1 

/r 


K (x,y) 


+ ln 


1 + $ 0 (x,y) 


|l + 6 Q 2 (x,y)j 


1/2 


(A-4) 


Since multiplication of a metric by a constant does not change its metric 
status, it is sufficient to show that* 


d(x,y) = 


K * 2 (x, y) 

1 + 6 2 (x,y) 


+ In 


(l + 6 2 



1/2 


(A-5) 


is a metric. 


*For simplicity of notation, we shall drop the zero subscript on 6q. 


-36- 


Consider a function $(t) defined over the domain R + (the set of positive 
real numbers) and taking on values over R + . Let <|>(t) have the following 
four properties: 


( 1 ) <|>( 0 ) > 0 

(2) 4><t) > 0 

(3) <t>'(t) > 0 (A-6) 

(4) 4>"(t) < 0 

We shall now show that <|>(6(x,y)) is a metric. We observe that (A-6) 
implies that <|>(t) is a monotonically increasing function with decreasing 
slope as t increases. From conditions (l)-(3) of (A-6), we have that (see 
Figure A-l) 

<|>(t 3 ) < <D(t 1 +t 2 ) for t 3 > t 1 +t 2 (A-7) 

Further imposing condition (4) of (A-6) results in 

♦(t +t > - 4>(t ) 

r <**<V 


4>(t ) 

> 4>’(t 2 ) > «|»’(t 1 ) 


(A-8) 


Combining (A-7) and (A-8), we have that 




(A-9) 


Letting 


t 1 = 4(x,y) 

t 2 = A(y,z) (A-10) 

t 3 = 6(x,z) 


then for the triangular inequality on 6, namely 
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6(x,z) > 6(x,y) + 6(y,z) (A-ll) 

we have from (A-9) that $(4) satisfies the triangular inequality 


4>(4(x,z)) < 4>(6(x,y)) + 4>(4(y,z)) (A-12) 

and is thus a metric. 

It remains to show that 4>(6(x,y)) = d(x,y) satisfies the conditions of 
(A-6 ) . 


Letting 


4>(t) 



+ In (1 + 



CA-13) 


we immediately observe that conditions (1) and (2) are satisfied. The first 
derivative of $(t) is given by 


= t ( 1+K+t 2 ) 
<|.Ct) (1+t 2 ) 2 


(A-14) 


which is obviously greater than zero for all t greater than zero. 
(Condition (3) is satisfied.) Finally, the second derivative of <(>(t) is 


<J>"(t) = 


l+K-3Kt 2 -t 4 

(1+t 2 ) 3 


2 2 2 
_1 t (1+K+t ) 


2 2 4 

♦ (t) (1+tV 


(A-15) 


2 2 

Since In (1+t ) < t , we have from (A-13) that 


* 2 (t) < 

1+t 


(A-16 ) 



-38- 


Combining (A-15) and (A-16) gives 


< - 


(3K+l)t 2 + t A 

(l + t 2 , 3 


(A-17 ) 


which obviously satisfies condition (4). Q.B.D. 
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Table 1 

The Solutions of Equation (43) for Various Values of M 


M 

n* 

2 

1.1 

4 

1.3 

8 

3,5 

16 

7,9 

32 

7,9,23,25 

64 

19,27,37,45 


DIFFERENTIAL 

ENCODER 
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Figure 1. Block Diagram of the Trellis Coded System 
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SYMMETRIC 4-PSK 


(b) 



Figure 3. (a) Symmetric QPSK Signal Point Constellation, 

(b) Trellis Diagram for Conventional Rate 1/2 
Trellis Coded QPSK 
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0,0 

1,5 

2,2 



Figure 5. Trellis Diagram for Multiple (k = 2) Rate 2/3 Coded 
8PSK; 2 States 
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n n n 



Figure 7. 4-State Trellis for Rate 4/5 Hybrid QPSK/8PSK 
Multiple TCM 



Figure 8. Set Partitioning Method for Multiple (k = 2) Trellis 
Codes on the Fading Channel 
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Figure A-l . An Example of a Function that Satisfies the 
Conditions for a Metric 



